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Abstract 



We study the question of whether every P set has an easy (i.e., polynomial- 
time computable) census function. We characterize this question in terms of 
unlikely collapses of language and function classes such as #P 1 C FP, where 
#P 1 is the class of functions that count the witnesses for tally NP sets. We 

prove that every #Pf H function can be computed in FP* Pl 1 . Consequently, 
every P set has an easy census function if and only if every set in the polynomial 
hierarchy does. We show that the assumption C FP implies P = BPP and 
PH C MODfcP for each k > 2, which provides further evidence that not all sets 
in P have an easy census function. We also relate a set's property of having an 
easy census function to other well-studied properties of sets, such as rankability 
and scalability (the closure of the rankable sets under P-isomorphisms) . Finally, 
we prove that it is no more likely that the census function of any set in P can 
be approximated (more precisely, can be n°-enumerated in time rfi for fixed a 
and (3) than that it can be precisely computed in polynomial time. 



1 Introduction 



Does every P set have an easy (i.e., polynomial-time computable) census function? Many 
important properties similar to this one were studied during the past decades to gain more 
insight into the nature of feasible computation. Among the questions that were previously 
studied are the question of whether or not every P set has an easy to compute ranking 
function [GS91,HR90], whether every P set is P-isomorphic to some rankable set |GH96| , 
whether every sparse set in P is P-printable [HY84 , AR88 , RRW94 1 , whether every infinite 
set in P has an infinite P-printable subset [|AR88 , HRW97a | , whether every P-printable set is 



P-isomorphic to some tally set in P [AR8J], and whether every P set admits easy certificate 
schemes [ |HRW97a , pRW97b| , to name just a few. Some of those questions arise in the 
field of data compression and are related to Kolmogorov complexity, some are linked to the 
question of whether one-way functions exist. 

Extending this line of research, the present paper studies the complexity of computing 
the census functions of sets in P. Census functions have proven to be a particularly 
important and useful notion in complexity theory, and their use has had a profound impact 
upon almost every area of the field. In particular, this regards the extensive literature related 
to the isomorphism conjecture of Berman and Hartmanis (e.g., [BH77, [Mah82 ], and many 
other papers), the work on the existence of Turing-hard sparse sets (or of polynomial-size 
circuits) for various complexity classes (e.g., [ KL80| ,KS85,BBS86,HR97|), the results relating 



the computation times for NP sets to their densities and the results on P-print ability [HY84 
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results on positive relativization and relativization to sparse oracles (e.g., [ Lon85| , [LS86| , 
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BBS86| 1), the unexpected collapse of the strong exponential-time hierarchy [ Hem8£ ] , and 
applications to extended lowness [HJRW]. 



Valiant, in his seminal papers | Val79a , Val79b |, introduced #P, the class of functions 
that count the solutions of NP problems, and its tally version for which the inputs 
are given in unary. Although #Pi has not become as prominent as #P, it contains a 
number of quite interesting and important problems such as the problem Self -Avoiding 
Walk (see RWel93l ): Given an integer n in unary, compute the number of self-avoiding 
walks on the square lattice having length n and rooted at the origin. Self -Avoiding Walk 
is a well-known classical problem of statistical physics and polymer chemistry, and it is 
an intriguing open question whether Self -Avoiding Walk is ^Pj^-complete (see [Wel93]). 
Known problems complete for #Pi |Val79b] have the form: Given an integer n in unary, 
compute the number of graphs having n vertices and satisfying a fixed graph property ir. 

In Section we will characterize the question of whether every P set has an easy census 
function in terms of collapses of language and function classes that are considered to be 
unlikely. In particular, every P set has an easy census function if and only if #Pi Q FP. The 
main technical contribution in Section || is Theorem |3.7j : ^Pf H is contained in FP* Pl 1 . An 
immediate consequence of this result are upward collapse results of the form: the collapse 
#1 • P Q FP implies the collapse #1 • PH C FP. Thus, every P set has an easy census 
function if and only if every set in the polynomial hierarchy has an easy census function. 
Note that the corresponding upward collapse for the # operator applied to the levels of PH 
follows immediately from the upward collapse property of the polynomial hierarchy itself: 
# • P C FP implies NP = P and thus PH = P; so, # • PH = # • P C FP. However, 
for the #1 operator this is not so clear, since the assumption #i • P C FP merely implies 
that all tally NP sets are in P (equivalently, NE = E), from which one cannot immediately 
conclude that ■ NP or even #i • PH is contained in FP. In fact, Hartmanis, Immerman, 
and Sewelson [ HIS85| ] show that in some relativized world, NE = E and yet the (weak) 
exponential-time hierarchy does not collapse. In light of this result, it is quite possible that 
the assumption of all tally NP sets being in P does not force all tally sets from higher levels 
of the polynomial hierarchy into P. 

We also show that the assumption #Pj C FP implies both P = BPP and PH C MOD fc P 
for each k > 2 (Theorem |3.6| ) , which provides further evidence that not all sets in P have a 
census function computable in polynomial time. We also relate a set's property of having 
an easy census function to other well-studied properties of sets, such as rankability GS91] 
and scalability JGH96 ]. In particular, though each rankable set has an easy census function, 
we show that (even when restricted to the sets in P) the converse is not true unless P = PP. 
This expands the result of Hemaspaandra and Rudich that every P set is rankable if and 
only if P = PP |HR90| 1 by showing that P = PP is already implied by the apparently weaker 
hypothesis that every P set with an easy census function is rankable. 

Cai and Hemaspaandra CH86] introduced the notion of enumerative counting as a 
way of approximating the value of a #P function deterministically in polynomial time. 
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Hemaspaandra and Rudich [HR90| show that every P set is fc-enumeratively rankable for 
some fixed k in polynomial time if and only if #P = FP. They conclude that it is no more 
likely that one can enumeratively rank all sets in P than that one can exactly compute their 
ranking functions in polynomial time. In Section ||, we similarly characterize the question 
of whether the census function of all P sets is n a -enumerable in time n@ for fixed constants 
a and (3, or equivalently, whether every function is re Q -enumerable in time rfi . We 
show that this hypothesis implies #Pi C FP, and we thus conclude that it is no more likely 
that one can n a -enumerate the census function of every P set in time "nP than that one can 
precisely compute its census function in polynomial time. 

Finally, Section |5| provides a number of relativization results. 



2 Notation and Definitions 

Fix the alphabet £ = {0, 1}. S* denotes the set of all strings over S, and £+ = £* \ {e}, 
where e denotes the empty string. For any string x € £*, we denote the length of x by 
For any set L C £*, the number of strings in L is denoted \L\, and the complement of L 
in S* is denoted L. Let L =n (respectively, L- n ) denote the set of strings in L of length n 
(respectively, of length at most n). As a shorthand, we use E n to denote (S*) =n . For any 

set L, the census function of L, censusL : S* — > N, is defined by census^ 
and xl denotes the characteristic function of L, i.e., xl{x) = 1 if x G L, and xl{x) = if 
x g" L. A set S is said to be sparse if there is a polynomial p such that for each length n, 
censuss(l n ) < pin). A set T is said to be tally if T C {1}*. To encode pairs of strings, we use 
a one-one, onto pairing function, (•,•) : S* x S* — > £*, that is computable and invertible 
in polynomial time; this pairing function is extended to encode m-tuples of strings as is 
standard. For convenience, we will sometimes write m-tuples of strings x\, x%, ■ ■ ■ , x m € X* 
explicitly as xi#X2# . . . #i m , using a special separating symbol # not in S. We let < 
denote the standard lexicographic order on E*. 

The definition of Turing machines and their languages, Turing transducers and the 
functions they compute, relativized (i.e., oracle) computations, (relativized) complexity 
classes, etc. is standard in the literature (see, e.g., the textbooks |HU79| , [B"C93 , Pap94 1 ) . 



We briefly recall the complexity classes most important in this paper. FP denotes the 
class of polynomial-time computable functions. FPi is the class of functions computable 
in polynomial time by deterministic transducers with a unary input alphabet. FE is the 
class of functions that can be computed by deterministic transducers running in time 2 cn for 
some constant c. Let E = f U c>0 DTIME[2 cn ] and NE = (J c>0 NTIME[2 cra ]. An unambiguous 
Turing machine is a nondeterministic Turing machine that on each input has at most one 



lr The census function of L at n is often denned as the number of elements in L of length up to n in the 
literature. This definition and our definition are compatible as long as our computability admits subtraction. 
We also note that we let census^ map strings l n (as opposed to numbers n in binary notation) to | L =n | to 
emphasize that the input to the transducer computing census^ is given in unary. 
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accepting path. UP | Val76 | (respectively, UE) is the class of all languages accepted by some 
unambiguous Turing machine running in polynomial time (respectively, in time 2 cn for some 
constant c). 

For any nondeterministic Turing machine M and any input x G £*, let accM(^) denote 
the number of accepting paths of M(x). A spanP machine JKST8S ] is an NP machine that 
has a special output device on which some output is printed for each accepting path. For 
any spanP machine M and any input x G £*, spanj^(x) is defined to be the number of 
different outputs of M{x) if M(x) has at least one accepting path, and otherwise. A tally 
NP machine (respectively, a tally spanP machine) is an NP (respectively, a spanP) machine 
with a unary input alphabet. 

Definition 2.1 1. ||Val79a|JVal79b|| #P = {acc M I M is an NP machine}. 



2. |Val79bfl #P X = {acc M I M is a tally NP machine}. 



3. 1KST891 spanP = {span M \ M is a spanP machine}. 



df 



4- spanP 1 = {span M \ M is a tally spanP machine}. 



df 



5. #E = {accM | M is an NE machine}. 



df 



6. [ MS72|JSto77|| The polynomial hierarchy is inductively defined as follows: £q = P, 

Ji>0 ' 



s p I NP s fe-i fork>l, and PH = U i>0 s f- 



7. [Gil77] PP is the class of languages L for which there exist a set A in P and a 
polynomial p such that for all strings x G S* ; 



x G L 



{y | |y| = p(|a?|) and (x,y) G ^4} | > 2P^~ 1 . 



8. [Gil77] BPP is the class of languages L for which there exist a set A in P and a 
polynomial p such that for all strings x G £*, 

x G L | {y | \y\ = p(\x\) and (x, y) G" A} \ < 2 p( -^~ 2 , and 

X ^L | {y | |y| = p(|x|) and (x, y) eA}\< 2 M ~ 2 . 

9. JCH9q , per90| , pG92| For any fixed k > 2, MOD fc P is the class of languages L for 
which there exist a set A in P and a polynomial p such that for all strings x G £*, 

x G L | {y | \y\ = p(\x\) and (x, y) G A} \ ^ mod fc. 

If k = 2, we write 0P (introduced in [)PZ83| , |GP86(1 ) instead o/MOD 2 P. 

10. [OH93,FFK94] SPP is the class of languages L for which there exist a set A in P 



and a polynomial p such that for all strings x G X* ; 

x G L | {y | \y\ = p(\x\) and (x, y) G ^4} 

x L =^> | {y | \y\ = p(\x\) and (x, y) G ^4} 



2P(M)-l + l, and 
2p([z|)-i_ 
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11. [KL8C] For any language class C, let C/poly be the class of all languages L for 
which there exist a set A € C, a polynomial p, and an advice function h : X* — > X* 
such that for each length n, \h(l n )\ = p[n), and for every x G X* ; x G L if and 
only if (x,h{\\ x \)) G A. For any function class T , let JF/poly be the class of all 
functions g for which there exist a function f G T , a polynomial p, and an advice 
function h : X* —> X* such that for each length n, \h(l n )\ = p(n), and for every 
xGX*, g(x) = f({x,h(lW))). 

We will use the common operator notation at times in order to generalize function classes 
such as #P and #Pi- 

Definition 2.2 For any language class C, define 

1. jf ■ C to be the class of functions / : X* — > N for which there exist a set A G C and a 
polynomial p such that for each x G X* ; 

f(x) = \ {y \ \y\ = p(\x\) and (x, y) G A} |, and 

2. #i • C to be the class of functions f : X* — > N for which there exist a set A G C and a 
polynomial p such that for each n G N, 

f(l n ) = \{y\ \y\=p(n) and {l n ,y) G A}\. 

Definition 2.3 1. A bisection <j) : X* — > X* is a P-isomorphism if <p is computable and 
invertible in polynomial time. 

2. A P -isomorphism <j> is length-preserving if for all x G X*, |</>(:e)| = \x\. 

3. A P -isomorphism (f) mapping set ACS* to set B C X* is order-preserving if for any 
two strings x and y satisfying either x,y G A or x,y A, if x < y, then (p(x) < 4>(y). 



Definition 2.4 [GS91] The ranking function of a language A C X* is the function r : X* 



N that maps each x G X* to \ {y < x \ y G ^4} | . A language A is rankable if its ranking 
function is computable in polynomial time. 



Goldsmith and Homer [GH96] introduced the property of scalability, a more flexible 
notion than rankability in which the rank of some given element within the set is not 
necessarily determined with respect to the lexicographic order of X*, but rather with respect 
to any well-ordering of X* that can be "scaled" by a polynomial-time computable and 
polynomial-time invertible bijection between N and X*. Equivalently, the scalable sets are 
precisely those that are P-isomorphic to some rankable set. The definition below is based 
on this characterization. 



Definition 2.5 | GH96 | A language A is scalable if it is P-isomorphic to a rankable set. For 
any oracle X, the X-scalable sets are those that are ~P X -isomorphic to some set rankable 
in FP X . 
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3 Does P Have Easy Census Functions? 



We start with exploring the relationships between the properties of a set being rankable, 
being scalable, and having an easy census function. Let A be any set (not necessarily in P). 
Consider the following conditions: 

(i) A is rankable. 

(ii) A has an easy census function. 

(iii) A is P-isomorphic to some rankable set (i.e., A is scalable). 

(iv) A is P-isomorphic to some rankable set via some length-preserving isomorphism. 

(v) A is P-isomorphic to some rankable set via some order-preserving isomorphism. 

It is immediately clear that for any set A, (i) implies each of (ii), (iv), and (v), and each 
of (iv) and (v) implies (iii). The next proposition shows that the rankable sets are closed 
under order-preserving P-isomorphisms (thus, conditions (i) and (v) in fact are equivalent) 
and that the class of sets having an easy census function is closed under length-preserving 
P-isomorphisms. The latter fact immediately gives that (iv) implies (ii), since each rankable 
set has an easy census function. The inclusion structure of the sets in P satisfying Properties 
(i) through (iv) is given in Figure |]. 

Proposition 3.1 1. The class of all rankable sets is closed under order-preserving P- 
isomorphisms. 

2. The class of sets having an FP- computable census function is closed under length- 
preserving P -isomorphisms. 

Proof, (jlj). Let A be P-isomorphic to a rankable set B via some order-preserving 
isomorphism. Since B is rankable, B is rankable. Let respectively r and f be the ranking 
functions for B and B. For any string x £ £*, let lex(x) denote the lexicographic order 
of x. Define the function 

s df j r(x) if x <E A 

\ lex(x) — f(x) if x A. 

Clearly, r 1 is computable in polynomial time and r' is the ranking function for A. 

(|2|). Let A be P-isomorphic to a set B with censusB G FP via some length-preserving 
isomorphism (p. Then, <j)(A =n ) = B =n . So, for every n, census^ = censuss- This implies 
census b € FP. | 

So we are left with only the four conditions (i) to (iv). Since there are nonrecursive sets 
with an FP-computable census function, but any set satisfying one of (i), (iii), or (iv) is 
in P, condition (ii) in general cannot imply any of the other three conditions. On the other 
hand, when we restrict our attention to the sets in P having easy census functions, we can 



6 




P-isomorphic to some rankable set 
via some length-preserving isomorphism 



rankable 



Figure 1: Inclusion structure of the sets in P satisfying Properties (i) through (iv). 
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show that (ii) implies (i) if and only if P = PP. Thus, even when restricted to P sets, it is 
unlikely that (ii) is equivalent to (i). 



Theorem 3.2 All P sets with an easy census function are rankable if and only if~P = PP. 

Proof. Hemaspaandra and Rudich show that P = PP (which is equivalent to P# p = P) 
implies that every P set is rankable [ HR90| ]. Conversely, let L be any set in PP, and let A 
be a set in P and p be a polynomial such that for all x G S*, 

x G L | {y | \y\ = p(\x\) and x#y G ^4} | > 2 p(|x ' l) ~ 1 . 

Define 

T = {b#x#y | x,y G £*, \y\ = p(\x\), b G {0, 1}, and X A(x#y) = b}. 

Clearly, T G P. Also, the census function of T is easy to compute: Given n in unary, 
compute the largest integer i such that i + p{i) + 3 < n. Then, 



censusT(l n ) 



2 i+P(i) if i + p (i) + 3 = n 
if i +p{i) + 3 < n. 



Since T G P and censusT G FP, by hypothesis T is rankable. Let r be the ranking function 
for T. Since for each x G S + , 

x£L^ r(0#x#l p ^l)) - r(l#£#l p d £ D) < 2 P ^~ 1 , 

where x is the lexicographic predecessor of x, and since the predicate on the right-hand side 
of the above equivalence can be decided in polynomial time, it follows that L G P. | 

Corollary 3.3 All P sets are rankable if and only if all sets in P with an easy census 
function are rankable. 

One might ask whether or not all P sets outright have an easy census function (which, 
if true, would make Corollary |3.3| trivial). The following characterization of this question 
in terms of unlikely collapses of certain function and language classes suggests that this 
probably is not true. Thus, Corollary |3.3j is nontrivial with the same certainty with which 
we believe that for instance not all #P^ functions are in FP.0 

Theorem 3.4 The following are equivalent. 

1. Every P set has an FP -computable census function. 

2. #p 1 c FP. 

2 It is not difficult to construct — by standard techniques — an oracle relative to which =ffP 1 % FP. On the 
other hand, we will show in Section ^| that, relative to some oracle, #P X C FP, yet #P 7^ FP (and thus 
PP/P). 
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3. #E = FE. 
I P# p i = P. 

5. For every language L accepted by a log space-uniform depth 2 AND- OR circuit family 
of bottom fan-in 2, censusi is in FP. 

Proof. To show that (|l|) implies (||), let / be any function in #P 1 . Let M be some tally 
NP machine with accAf = /. Assume that M runs in time n k , for some constant k. Define 

A = {x | = n k for some n and x encodes an accepting path of M(l n )}. 

Clearly, A is in P (note that n can be found in polynomial time, since computing the kth 
root of some integer can be done in polynomial time). Now from our hypothesis it follows 
that census a is in FP, and since census a = accAf, we have / € FP. 

Conversely, let A be an arbitrary set in P. Define M to be the tally NP machine that, 
on input 1™, guesses an x € {0, l} n , and for each x guessed, accepts along the path for x if 
and only if x 6 A. Then, accA/ = census a- Since by hypothesis accAf £ FP, it follows that 
census a € FP. 

The equivalence of (|2|) and (||) can be proven by means of standard translation — this is 
essentially the function analog of Book's result that every tally NP set is in P if and only 
if NE = E | Boo74 | (see |Har85 . HIS85 for the extension of this result to sparse sets). 



The equivalence of (||) and (|4|) is straightforward. 

It is easy to see that (|2|) implies (||). In order to prove that (0) implies (|2|), note that 
computing the number of satisfying assignments for monotone 2CNF formulas is complete 



for ^tP [Val79b] under logspace reductions. Now, given a function / in #P lf there exist 
logspace computable functions R, S, p such that for all n, R(l n ) is a monotone 2CNF formula 
with p(l n ) variables, and f(l n ) equals the number of satisfying assignments for R(l n ) 
divided by S(l n ). The reduction R can be modified so that for every n, p(l n+1 ) > p(l n ). 
Now let C m be the circuit defined as follows: (a) if m = p(l n ) for some n, then C m is 
a depth 2 AND-OR circuit that tests whether an assignment, given as the input, satisfies 
R(l n ), and (b) if not, C m is a depth 1 AND circuit that rejects all inputs. This circuit 
family F = {C m } is logspace-uniform. Now let A be the language accepted by F. Then, 
for every n, /(l n ) = census A {l p{in) ) / S{l n ). Thus, © implies that / £ FP. | 



Theorem |3.4j can as well be stated for more general classes than #Pi = #i • P. In 
particular, this comment applies to ■ C, where for instance C = NP or C = PH. Noticing 



that spanPj^ = ■ NP and focusing on the first two conditions of Theorem ^J, this 
observation is exemplified as follows. 

Theorem 3.5 1. Every NP set has an FP -computable census function if and only if 
spanPi C FP. 

2. Every set in PH has an FP '-computable census function if and only ifff\ ■ PH C FP. 
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We will show later that the conditions of Theorem 3.4 in fact are equivalent to the two 



conditions stated in either part of Theorem 3.5. Next, we give some more evidence that the 
collapse #Pi Q FP is unlikely to hold. 

Theorem 3.6 // #P 1 C FP, then the following holds: 

1. For any fixed k > 2, PH C MOD fc P, and 

2. P = BPP. 



Proof. For the first part, notice that Toda and Ogihara [F092| show that for each k > 2 
and any set L, if L £ PH, then L £ MOD^P/poly with an advice computable in (the 
function analog of the language class) PH MODfeP . Also, they show that for every k > 2, 
pjjMOD ft p q p#P[i] ) w here the [1] in the superscript indicates that on every input at most 
one call to the #P oracle is allowed. Thus, the advice function for L is in FPf P[1] . Fix 
k > 2 and L £ PH, and take an advice function / £ FP* P M that puts L into MOD^P/poly. 
Let T be the polynomial-time oracle transducer with function oracle g £ #P that witnesses 
/ £ FPf P[1] . W.l .o.g., assume that T makes exactly one oracle call on each input (by 
asking a dummy query if necessary). Define the function g± that, on input l n , returns 
the value g(q n ), where q n is the one query string computed by T on input l n . Thus, / in 
fact is computable in FP 1 1 and so, by our supposition, in polynomial time. Since L 
is in MODfcP/poly with polynomial-time computable advice, it follows that L £ MOD^P. 
Hence, PH C MOD fc P. 

In order to prove the second part, notice that BPP is in P/poly ||Adl7§| 1 with an advice 
computable in (the function analog of) PH | Sip83 , Lau83 ], and that PH C P# P W by Toda's 



Theorem |rod9lj . An argument similar to the above shows that P = BPP. 



Now we show that the conditions of Theorem 3.4 in fact are equivalent to the two 

To this end, we establish the following 



conditions stated in either part of Theorem 3.5 



theorem, which is interesting in its own right. Theorem |3.7| is the main technical contribution 
in this section. 



Theorem 3.7 #P PH C FP #p i 



#Pi 



Remark 3.8 1. Note that Toda's result PH C P# P W [ p?od91[| immediately gives that 
#P PH C #P* p K an d ^pPH c #pf p[1] . Observe that the oracle is a #P function. 



Theorem 3.7 establishes containment 



In contrast to the inclusion #P PH C #P# P[1] , 
of #P PH in a class in which only #P^ oracles occur. Though our proof also applies 
the techniques of [Tod91,r092], the result we obtain seems to be incomparable with 
the above-mentioned immediate consequence of Toda's Theorem. 



It is unlikely that Theorem 3.7 can be extended to FP PH or even #P PH being contained 



in FP* Pl 1 , since this would imply that FP PH C FP/poly and thus, in particular, 
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would collapse the polynomial hierarchy. In contrast, the inclusion FP PH C FPi/poly 
that does follow from (the proof of) Theorem ^O] merely implies that all tally sets in 
PH have polynomial-size circuits, a true statement that has no unlikely consequences. 

3. The proof of Theorem in fact establishes a more general claim. Since ©P PH /poly = 
©P/poly |TQ92| , Theorem 3/7 and its corollaries can be stated even with PH replaced 



by ©P PH (note that ©P PH = BPP® P by Toda's result |Tod91j| ). However, we focus 



on the PH CclS6 ? ELS this is a more natural and more central class. 

Proof of Theorem |7f|. Let / be any function in #P PH . Note that #P PH = #i - PH, since 
PH is closed under Turing reductions. Thus, there exist a set L G PH and a polynomial p 
such that for each length n, f(l n ) = \ {y G {0, l} p ( n ) | \ n j^y G L} |, where for convenience 
we assume that p{n) is a power of 2 for each n. By Toda and Ogihara's result that PH C 
©P/poly TQ92|, there exist a set A G ©P, an advice function h : S* — > £*, and a 



polynomial q such that for each length m and each x of length m, \h(l m )\ = q(m), and 
x G L if and only if (x, h(l m )) G A. Let M be a machine witnessing that A G ©P, i.e., for 
every string z, z G A if and only if &ccm{z) is odd. 



Toda [Tod91| defined inductively the following sequence of polynomials: For j G N, let 



so(j') = j, and for each j G N and « > 0, let 

Si (i)l3( Si _ 1 (i)) 4 + 4( Si _ 1 (i)) 3 . 

One very useful property of this sequence of polynomials is that for all i, j € N, Sj(j') = c- 2 2 ' 
for some c G N if j is even, and Sj(j) = d ■ 2 2 * — 1 for some <i G N if j is odd (see [fTod91 1 for 
the induction proof). 

We describe a polynomial-time oracle transducer T that, on input 1 , invokes its #PJ 1 
function oracle g and then prints in binary the number f(l n ). Fix the input l n . First, T 
transfers the input to the oracle g. Formally, function g is defined by 

i £ (^ i (acc M ((l n #y,/ l (l n+1+p( " ) ))))) 2 , 

df 

where £ n = logp(n). 

Informally speaking, that g is in #P X 1 follows from the properties of the Toda 
polynomials, from the closure of #P under addition and multiplication, and from the fact 

-li- ~p M 1 -Ji. pt 

that advice function h is computable in FP X 1 . More formally, to show that g G #Pi 1 , 
we describe a tally NP oracle machine G and a #Pi oracle g\ for G such that, for every n, 
the number of accepting paths of G on input l n with oracle g\ equals g(l n )- On input 1", 
G first gets the advice string a n = /j(l n + 1+ P( n )) of length q(n + 1 + p(n)) via one call to 
some appropriate #Pi oracle, say g±. This is possible by the argument given in the proof 



Indeed, P/poly is known to contain all tally sets and even the Turing closure of the sparse sets. 
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of Theorem 3.£, where g\ is described. Then, G guesses all strings y of length p(n) and for 
each y guessed proceeds as follows. For fixed y, let j y be a shorthand for accM((l n #2A o- n )). 
Then, {s£ n (j y )) 2 is a polynomial of degree 2 2fn+1 , which is polynomial in n. Also, the 
coefficients of this polynomial are deterministically computable in time polynomial in n 
(see [ Tod91| ). Since accM £ #P and #P is closed under addition and multiplication, the 



function mapping {l n #y, a n ) to (s£ n (j y )) 2 is in #P. Let G be an NP machine witnessing 
that this function is in $=P. Then, G on input l n can for each guessed y produce exactly 
(s£ n (j y )) 2 accepting paths by simulating G on input (l n #y, a n ). Again using the closure of 
#P under addition, it follows that g E #P X , as claimed. 

By the above properties of the Toda polynomials, it follows that for each y of length p(n), 
if j y is even, then sg n (j y ) = c • 2 2 " for some c € N, and if j y is odd, then S£ n (j y ) = d ■ 2 2 n — 1 
for some d G N. Thus, recalling that 2^ n = p(n), we have 

j y is even ( S , n (j,)) 2 = (c 2 • 2^- 1 )2pW+ 1 , and 

is odd (^ n (j J/ )) 2 = (d 2 -2^)- 1 -d)2^) +1 + l. 

Defining the integer-valued functions c(n) = c 2 • 2 P ^~ 1 and d(n) = d 2 ■ 2 P ^~ 1 — d, we 
obtain 



c(n) • 2 p ( n ) +1 if is even 
d(n) ■ 2 p ( n ) +1 + 1 if j y is odd. 



Thus, since /(l n ) < 2 p ^ n ) and since j y is odd if and only if l ra #y G L, the rightmost p(n) + 1 
bits of the binary representation of g(l n ) represent the value of /(l n ). Hence, after the 
value g(l n ) has been returned by the oracle, T can output /(l n ) by printing the p(n) + 1 
rightmost bits of g(l n ). This completes the proof. I 



since #P X C FP implies FP #p i C FP, we have from Theorem 13.71 the following 



corollary. 

Corollary 3.9 #P 1 C FP if and only if #Pf H C FP, and in particular, ifP\ Q FP if and 
only i/spanPx C FP. 



Corollary 3.£ together with the equivalences of Theorems 3.4 and gives the following 



Corollary 3.10 Every P set has an easy census function if and only if every set in PH has 
an easy census function. 



Kobler et al. [KST89| proved that spanP = #P if and only if NP = UP. Their proof 



also establishes the analogous result for tally sets: 



Lemma 3.11 (implicit in [ KST89[ ]) spanP-L = #P^ if and only if every tally NP set is 
in UP. 
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Using Lemma 3.11, we show that spanP 1 and #Pi are different classes unless NE = UE, 
or unless every sparse set in NP is low for SPP. A set S is said to be C-low for some class 
C if C s = C (see, e.g., |pdi83|jKS85|J5ch87|jKSTf92| for a number of important lowness 
results). In particular, it is known that every sparse NP set is low for P NP 1KS85| and for 
PP |[KSTT92| , but it is not known whether all sparse NP sets are low for SPP. Toran's 
result that in some relativized world there exists some sparse NP set that is not contained 



in ©P | Ibr88 l, and thus not in SPP, may be taken as some evidence that not all sparse 
NP sets are SPP-low. Since Corollary |3,12 relativizes, spanP 1 ^ holds relative to the 
same oracle. 

Corollary 3.12 If spanP l = #P 1; then 

1. NE = UE and 

2. every sparse NP set is low for SPP. 

Proof. The hrst part follows from a standard upward translation argument (as mentioned 
in the proof of Theorem . 

For the second part, assume spanP x = #P X , and let S be any sparse set in NP. Clearly, 
S polynomial-time truth-table reduces to some tally NP set T. By Lemma pi), our 
assumption implies that T e UP, and thus T e SPP. Since P SPP = SPP, S £ SPP. 
The result now follows from the self-lowness of SPP [FFK94]. | 



4 Enumerative Approximation of Census Functions 



Cai and Hemaspaandra [pH8£ ] introduced the notion of enumerative counting as a way 
of approximating the value of a #P function deterministically in polynomial time. 



Definition 4.1 |CH89| ] Let f : E* -> S* and g : N -> N be two functions. A Turing 
transducer E is a (7(n)-enumerator of f if for all n € N and x G S n , 

1. E on input x prints a list C x with at most g{n) elements, and 

2. f(x) is a member of list C x . 

A function f is g(n)-enumerable in time t(n) if there exists a g(n)- enumerator of f that 
runs in time t(n). 

A set is g(n)-enumeratively rankable in time t(n) if its ranking function is g(n)- 
enumerable in time t(n). 

Recall from the introduction Hemaspaandra and Rudich's result that every P set is k- 
enumeratively rankable for some fixed k (and indeed, even C(n 1//2_<E )-enumeratively rankable 
for some e > 0) in polynomial time if and only if #P = FP |HR90| 1 . They conclude that it 
is no more likely that one can enumeratively rank all sets in P than that one can exactly 
compute their ranking functions in polynomial time. We similarly characterize the question 
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of whether the census function of all P sets is n a -enumerable in time n' 3 for fixed constants 



a and j3. By the argument given in the proof of Theorem 3.4, this is equivalent to asking 
whether every #Pi function is n a -enumerable in time n@ . We show that this implies 
C FP, and we thus conclude that it is no more likely that one can n a -enumerate the 
census function of every P set in time vP than that one can precisely compute its census 
function in polynomial time. It would be interesting to know if this result can be improved 
to hold for polynomial time instead of time t for some fixed polynomial t(n) = n@. 

Theorem 4.2 Let a, (3 > be constants. If every #P X function is n a - enumerable in time 
nP, then #P X C FP. 



Proof. Cai and Hemaspaandra [CH91] show that for any fixed k, if #SAT (the function 
mapping any boolean formula / to the number of satisfying assignments of /) is n k - 
enumerable, then #P C FP. In order to prove this, they develop the following protocol for 
computing the permanent of an m x m matrix A, given as parameters (the encoding of) 
a polynomial-time transducer E (the enumerator for #SAT), and a prime number p: Set 
Aq = A to the input matrix and repeat the following steps for i = 1, . . . ,m — 1: 

1. Construct from ^4j_i an (m — i) X (to — i) matrix Bi(X), defined by 

m—i 

Bi(X) i e k(X)a lk A\ 



df ^ n {Y ^^ A (l,k) 
" -1 ' 

k=l 



where e&(X) is a degree (m — i) polynomial in X such that e^^X) = 1 if X = k 
and otherwise, a\k is the (1, k) entry of and A^'^ is the (1, /c)-minor of 

Each matrix is viewed as a matrix over Z/pZ, that is, the matrix entries are reduced 
modulo p. Then the following conditions hold. 

• Each entry of Bi(X) is a degree (m — i) polynomial in X with coefficients in 
{0, . . . ,p — 1}, so perm(Bi(X)) is a degree (m — i) 2 polynomial in X. 

• J2T=i P erm ( B i( k )) = perm^-i). 

2. Encode Bi(X) into a binary string specifying in binary p, m, and the coefficients of 
Bi(X). There is some fixed constant c > such that the encoding length is at most 
c(m — i) 3 logp. Define Qi(X) = perm(i?j(X)). Then, Qi is a polynomial of degree at 
most (m — i) 2 , whose coefficients are each length-bounded by a fixed polynomial in 
p and m. Thus, there is a #P function G that maps Bi(X) to a number from which 
the coefficients of Qi can be decoded in polynomial time. 

3. Use E as an enumerator for G to obtain candidates gx, ■ ■ ■ ,gt- These are all degree 
(m — i) 2 polynomials that are pairwise distinct. Since two distinct degree (m — i) 2 
polynomials can agree at no more than (m — i) 2 — 1 points, there are fewer than 
t 2 (m — i) 2 < t 2 m 2 — 1 points X at which any two candidate polynomials agree. Thus, 
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if p > t 2 m 2 , then there is an r € {0, ... ,p — 1} such that gj(r) ^ gk(f) for all j ^ k. 
Take the smallest such r and set Aj to Bi(r) with the entries reduced modulo p. Now, 
perm(Ai) modulo p specifies which gj is correct, so we can recover perm(ylj_i) modulo 
p in polynomial time. 

At the end of this loop, A m is a 1 x 1 matrix, so its permanent is easy to compute. 
Now working backwards again, we can recover perm(A) modulo p. If we do this for 
polynomially (in the encoding length of A) many distinct primes, then by the Chinese 
Remainder Theorem, we can recover the exact value of perm(A). 

Valiant [|Val79a | showed that the permanent of matrices whose entries are from the set 



{—1,0,1,2} is complete for #P. Analogously, we can show that there exists an infinite 
sequence of matrices [M±,M2, • • • ] such that (i) the mapping 1" — > perm(M n ) is complete 
for #P 1; (ii) the mapping l n — ► M n is polynomial-time computable, and (hi) for every n, 
M n is an n x n matrix whose entries are from {—1,0, 1, 2}. Because of (iii), perm(M n ) < 2 2n 
for all n. So, by the Chinese Remainder Theorem, for every n, the exact value of perm(M n ) 
can be computed from perm(M n ) modulo p for 2n arbitrary distinct primes p. Define 
polynomials q and s by q(n) = (n,n,n,2n) and s(n) = q(n) 2a n 2 . Define the function / 
from the tally strings to the set of natural numbers as follows. 

• If m = (H,n,i,j) for some H, i < n and j < 2n, then /(l m ) is G{Bi(X)) defined in 
the above protocol when we simulate the protocol under the following constraints: 

— The jth smallest prime > s(n) is used in place of p. 

— M n is used in place of the input matrix ^o- 

— H is viewed as (the encoding of) a Turing transducer and is used in place of the 
enumerator E. Here, for each k with 1 < k < % — 1, the input given to H in the 
kth round of the protocol is (H, n, k,j), not the matrix A^. Also, H is supposed 
to run in q(n)^ steps and generates at most q(n) a candidates in each round. If 
H does not halt in q(n)^ steps or generates more than q(n) a candidates at any 
point of the simulation, then the simulation is immediately aborted and the value 
/(l m ) is set to 0. 

• If m is not of the above form, /(l m ) is 0. 

This function / is in ^P^. First, there are only i < m rounds to be simulated and each 
round requires m a steps for candidate generation and some polynomial (in n) number of 
steps for other computations. Second, by the Prime Number Theorem, the 2nth smallest 
prime > n is 0(n), so finding the jth smallest prime > s(n) requires only a polynomial 
number of steps. 

Now, by our assumption, there is an m a -enumerator E for / that runs in time mP . 
Since the number of candidates that E generates is at most m a and the dimension of the 
matrix M n is n, we have a prime > m 2a n 2 . This implies that with E as the enumerator, 
for every n > E, every j, 1 < j < 2n, and every i, 1 < i < n, we successfully find an r for 
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distinguishing the candidates. So, with E as the enumerator, for all n > E, perm(M n ) is 
polynomial-time computable. Hence #Pi C FP. | 

5 Oracle Results 

In this section, we provide a number of relativized results on the existence or 
non-existence of P sets simultaneously satisfying pairs of conditions chosen among the 
properties (i), (ii), and (hi) from Section H For instance, Theorem [0| and its Corollary lk2 



below exhibit a relativized world in which every P set has an easy census function 
(Property (ii)), yet there exists some set in P that is not rankable (Property (i)). 



Theorem 5.1 There exists an oracle D such that #Pf C FP-° ^ #P D 



From the relativized versions of Theorem 3.4 and of Hemaspaandra and Rudich's result 



in [ HR90 | that every P set is rankable if and only if P# p = P (which is equivalent with 
FP = #P, and this equivalence itself also relativizes), we immediately obtain the following 
corollary. 

Corollary 5.2 There exists an oracle D such that all sets in P D have a census function 
computable in FP D , yet there exists some set in P D that is not rankable by any function 
in FP D . 



Proof of Theorem 5.1. Balcazar et al. PBS86 ] and Long and Selman | ]LS86j ] proved 



that the polynomial hierarchy does not collapse if and only if it does not collapse relative to 
every sparse oracle. Since their proof relativizes (i.e., it applies to the relativized polynomial 
hierarchy as well), we have the following claim: 



Claim 5.3 ggggglggg For every set B, PH B does not collapse if and only if for every 



sparse oracle S, (PH B ) S does not collapse. 

Note that (PFP 3 ) 5 = PIP 9 ® 5 , where X © Y = {Ox | x £ X} U {ly \ y G Y} denotes 
the join of any two sets X and Y. Fix an oracle A such that PFL 4 does not collapse (such 

oracles were constructed by Yao [ Yao85| ], Hastad [ Has89 1 , and Ko [ Ko8S| ] who built on the 

jA®S 



work of Furst et al. [FSS84]). Then, by Claim 5.3 above, for every sparse set S, PH 



does not collapse. So, in particular, p^ 4 © 5, ^ NP^© 5 for every sparse set S. Since for every 
oracle B, #P B = FP B implies NP B = P B , we have that #P A © 5 ^ Fp A®s for every sparse 
set S. 

So it remains to prove that there exists a sparse set T such that Q FP w . 

Then, setting D = A © T completes the proof. 

Assume that our pairing function (•,-,•) is nondecreasing in each parameter, polynomial- 
time computable and invertible, and is one-to-one and onto. Let n[ \n% , • • ■ be a standard 
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enumeration of all tally NP oracle machines. For each i > 1, let pi be the polynomial time 
bound of . Then, the function /W defined by 

/ (-)(i(w>) I J acc jv«( in ) if ^H<i 

I otherwise 

is a canonical function complete for the class #P^.^] In particular, for every fixed set S, 
j(A(BS) is com pi ete f or #p*BS 

The oracle set T is defined in such a way that, for any given m = (i, n,j) in unary, some 
polynomial-time oracle transducer can retrieve the value of /(^® T )(l m ) from its oracle A®T 
by asking at most m queries. More formally, we construct T in stages such that for each 
m = (i,n,j): 

l k O m ~ k #b € T ^ 1 < k < \ f A ® T )(l m )\ and the fcth bit of f( A ® T )(l™) is b. 

Since by the above definition, |j( j4 ® T )(l m )| < rn and so, in particular, N A ® T (l n ) cannot 
query strings of length > m, there is no interference between the stages of the construction 
of T. It is easy to see that T is a sparse set satisfying #P^® T C ~PP A ® T . | 

Now we construct an oracle relative to which there exists some scalable set in P whose 
census function is not easy to compute. 

Theorem 5.4 There exists an oracle A such that there exists an A-scalable set B whose 
census function is not in FP" 4 . 

Proof. We will construct A and B in such a way that B is P^-isomorphic to the set 
R = {Ox | x € S*}, which is rankable in FP (and thus in FP 71 ). For each n > 1, we have 
censusji(l n ) = 2 n_1 . So census^ is easy to compute, but we want B to have a hard census 



function. In light of Proposition 3.1.|, we thus need the isomorphism, /, between B and R 
be non-length-preserving. In particular, we will define / so as to satisfy |/(x)| < |x| + 1 and 
< \y\ for all x,y G £*. When / is defined, we let B be the set f~ 1 (R). To have / 
and its inverse computable in FP" 4 , we encode / and f^ 1 into A = Aj Af-i as follows. 
For all x G E*, i > 1, and b E {0, 1}, we ensure that 

(1) (x, i, b) £ Af* <^=> the ith bit of f*(x) is b, 

where /* stands for either / or / — 1 . At the same time we diagonalize against FP" 4 so as to 
ensure census^ FP" 4 . 

Let ri , j . . . be a standard enumeration of all deterministic polynomial-time oracle 
transducers, and let pi,P2> • • • be a sequence of strictly increasing polynomials such that pi 
bounds the running time of Tj (independent of the oracle used). By (|l|) above, implicit in 



See [ Val79b[ for natural T^Pj^-complete functions. 
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the definition of / and / is the definition of A, so it suffices to construct the isomorphism. 
The construction of / and / _1 is in stages. By the end of stage i, f will have been defined 
for all strings of length up to r(i), where r will be determined below. Initially, we start with 
r(0) = 0, and we define /(e) = e. Stage i > of the construction is as follows. 

Stage i: Choose rii to be the smallest integer such that rii > r(i — 1) and Pi(rii) < 2 n *~ 2 . 
Let A' be the subset of A that has been decided by now. We want to define / so 
that, eventually, T^{l Ui ) + census B(l ni )- Simulate Tf on input 1"\ Whenever in 
this simulation a string of the form 0(x, i, b) whose membership in A has not yet been 
decided is queried, we add this string to A' and set the ith bit of f(x) to b unless we 
have already put 0(x,i, 1 — 6) into A (and thus have set this bit to 1 — b), or unless 
i > \x\ + 1. The same comment applies to query strings l(y,j,b) whose membership 
in A has not been decided yet and which may fix the jth bit of f~ 1 (y)- If we added 
the queried string to A' , we continue the simulation in the "yes" state; otherwise, in 
the "no" state. In this way, the simulation of (l ni ) may determine / (and 
on at most Pi(rii) < 2 n '~~ 2 bits of the strings of length rij. Thus, for no m > rij is 
f^ 1 determined on all strings of length m in R or R. Once the value T^' (l rii ) is 
computed, there is room to decide f(x) and / _1 (y) for all strings x and y of lengths 
between r(i — 1) and Pi(rii) so that / is an isomorphism mapping to U^— an< ^ 

such that censusB(l n ') ^ Tf- \l ni ), without changing the output value of (l ni ). 
Finally, define r(i) =pi{rii). | 



Next, we provide an oracle relative to which there exists some set in P that is neither 
scalable nor has an easy census function. 

Theorem 5.5 There exists an oracle D such that D G P D is not D-scalable and its census 
function is not in FP D . 

Proof. This is a simple interweaving of two diagonalizations. The only question is how to 
construct a non-scalable set. 



It is known from the work of Goldsmith and Homer |GH96] that any sparse set is scalable 
if and only if it is rankable, and this holds if and only if it is P-printable.^ D will be sparse, 
with at most 2 strings at each length. We assume that (T^ ^)i>i enumerates FP('\ and that 

^ runs in time n 1 . 

At stage 2i, we guarantee that Tf(l n ) does not compute the rank of l n in D, where 
n is chosen large enough that n l < 2 n . For this n, we put l n into D. Compute i; D (l n ), 
restraining any oracle strings of length > n that it queries. By our choice of n, this does 
not decide D =m for any m > n, so we can then put in the appropriate number of strings of 
length n for the diagonalization. 



A set is P -printable [HY84 if there exists a polynomial-time transducer T such that for each length n, 



T on input l n prints a list of all elements of the set up to length n. 
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At stage 2i + 1 we guarantee that Tf (l n ) does not compute the census function of D, 
where n is chosen large enough that n % < 2 n . Again, compute T^(l n ), restraining any 
oracle strings of length > n that it queries. By our choice of n, this does not decide D =m 
for any m > n, so we can then put in the appropriate number of strings of length n for the 
diagonalization. | 

Finally, we show that relative to an oracle, there exists some non-scalable set in P having 
an easy census function. 

Theorem 5.6 There exists an oracle A such that A 6 P is not A-scalable and its census 
function is in YY >A . 

Proof. We construct the oracle A so that A has one string of each length. For those lengths 
for which nothing else is decided, we put in l n . Otherwise, we do the following. 

To make the oracle A non- A-scalable, we actually make it non-P^-printable. At stage i, 
choose an appropriate length n, and then compute T^(l n ). Whenever it queries a string 
of length > n, restrain the string from the oracle. If it does anything except print out 
A- n , then put in the first unrestrained string of each length. If it correctly prints A up to 
length n, then choose an x of each relevant length to include that neither is restrained nor 
printed. I 

We conclude this section with a remark on a technical difficulty in proving the following 
statement: "There exists an oracle E such that all sets in V E have a census function 
computable in FP E , but E € P E is not ^-scalable." Call this statement (S). One might 
hope to prove (S) by exploiting again the fact that scalability, rankability, and P-printability 
are equivalent properties on the sparse sets |GH96| 1 , which was useful in the proofs of 



Theorems and Now, replacing in (S) non-scalability by non-rankability makes (S) 
the following stronger version of Theorem |5.l| : "There exists a sparse set E such that 
#Pf C FP E + #P S ." However, since the oracle D = A 01 constructed in the proof 
of Theorem |5.1| inherently is a nonsparse set due to its A part (and it cannot be made 
sparse unless one could separate the unrelativized polynomial hierarchy [ LS86 , BBS86| ), this 



approach does not work to prove (S). Therefore, to prove (S), one would need to construct a 
nonsparse set E with the desired properties, and we leave this as an interesting open issue. 
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